Using Structural Information for Identifying Similar Chinese Characters

نویسندگان

  • Chao-Lin Liu
  • Jen-Hsiang Lin
چکیده

Chinese characters that are similar in their pronunciations or in their internal structures are useful for computer-assisted language learning and for psycholinguistic studies. Although it is possible for us to employ imagebased methods to identify visually similar characters, the resulting computational costs can be very high. We propose methods for identifying visually similar Chinese characters by adopting and extending the basic concepts of a proven Chinese input method--Cangjie. We present the methods, illustrate how they work, and discuss their weakness in this paper.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

RAN: Radical analysis networks for zero-shot learning of Chinese characters

Chinese characters have a huge set of character categories, more than 20,000 and the number is still increasing as more and more novel characters continue being created. However, the enormous characters can be decomposed into a few fundamental structural radicals, only about 500. This paper introduces the Radical Analysis Networks (RAN) that recognize Chinese characters by identifying radicals ...

متن کامل

Visually and Phonologically Similar Characters in Incorrect Simplified Chinese Words

Visually and phonologically similar characters are major contributing factors for errors in Chinese text. By defining appropriate similarity measures that consider extended Cangjie codes, we can identify visually similar characters within a fraction of a second. Relying on the pronunciation information noted for individual characters in Chinese lexicons, we can compute a list of characters that...

متن کامل

A Cognition-Based Game Platform and its Authoring Environment for Learning Chinese Characters

We present integrated services for playing and building games for learning Chinese characters. This work is unique on two aspects: (1) students play games that are designed based on psycholinguistic principles and (2) teachers compile the games with software tools that are supported by sublexical information in Chinese. Players of the games experience and learn the grapheme-morpheme relationshi...

متن کامل

Chinese characters elicit face-like N170 inversion effects.

Recognition of both faces and Chinese characters is commonly believed to rely on configural information. While faces typically exhibit behavioral and N170 inversion effects that differ from non-face stimuli (Rossion, Joyce, Cottrell, & Tarr, 2003), the current study examined whether a similar reliance on configural processing may result in similar inversion effects for faces and Chinese charact...

متن کامل

Radical analysis network for zero-shot learning in printed Chinese character recognition

Chinese characters have a huge set of character categories, more than 20,000 and the number is still increasing as more and more novel characters continue being created. However, the enormous characters can be decomposed into a compact set of about 500 fundamental and structural radicals. This paper introduces a novel radical analysis network (RAN) to recognize printed Chinese characters by ide...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008